A quantum neural network computes its own 

relative phase 
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Abstract — Complete characterization of the state of a quantum 
system made up of subsystems requires determination of relative 
phase, because of interference effects between the subsystems. 
For a system of qubits used as a quantum computer this is 
especially vital, because the entanglement, which is the basis 
for the quantum advantage in computing, depends intricately on 
phase. We present here a first step towards that determination, in 
which we use a two-qubit quantum system as a quantum neural 
network, which is trained to compute and output its own relative 
phase. 

I. Introduction 

Entanglement is the root of the power of quantum 
computers[lJ; thus, the production and measurement of en- 
tanglement are essential if we are ever to be successful in 
making full use of the potential of quantum computing. This 
turns out to be a very hard problem. 

In previous work, we have proposed a method to find 
an entanglement witness for a general, unknown, quantum 
input state, using dynamic learning to find parameters for the 
quantum system that make it calculate its own entanglement. 
We called this a quantum neural network (QNN) [2|. The basic 
idea is that contained in the system itself is the information 
about its entanglement: If we find, through learning, an ap- 
propriate set of parameters for the system, then it can extract 
the entanglement of its initial state as an output measure of 
the state at some final time. We imagine that our quantum 
system evolves under some Hamiltonian containing adjustable 
parameters; we find that set of parameters such that our des- 
ignated output function (the qubit-qubit correlation function) 
is mapped onto the correct values for the entanglement of the 
initial state. Our entanglement witness gave good results for 
large classes of input states, including both pure and mixed 
states. Unlike the case with any other witness (see, e.g., [3)), 
the input state did not need to be "close" to any particular state. 
We have also (4), extended our work to the 3-, 4-, and 5- 
qubit cases, and found that as the size of the system grows, the 
amount of additional training necessary diminishes; thus, our 
method may be very practical for use on large computational 
systems. 

Figure 1 shows some representative results, in which we 
compare our entanglement witness to the entanglement of 
formation (6) for 50,000 randomly generated states for the 
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2-qubit Pure Slate QNN - Real Coelficients 




Entanglement of Formation 

Fig. 1. QNN entanglement for 50,000 randomly generated pure states of 
the form ooo|00 > +arji|01 > +ciio|10 > +aii|ll >, where arjo, aoi, 
aio.and an are all real, as a function of the entanglement of formation. 
Points lying along the dashed yellow line are states for which the entanglement 
predicted by the QNN witness exactly matches the entanglement of formation. 
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Fig. 2. As in Figure \T\ but with complex coefficients. 



2-qubit system. These are pure states with real coefficients 
on the usual ("charge") basis. The agreement is excellent. 
Unfortunately these results do not carry over to the more 
general case of complex coefficients. See Figure 2, which 
shows a similar set but with complex coefficients. Indeed, as 
we showed [2|, it is impossible to find any single measurable 
which will not exhibit anomalous oscillation; all witnesses do 
so. But is there a way to get around this difficulty? 

There are, of course, ways to determine more information 
about the state; if we know the entire density matrix we 



can, at least for the 2-qubit system, simply calculate the 
entanglement of formation (as we ourselves did to generate 
the comparison data for Figure Q]) For the 2-qubit system this 
may not be unreasonable. But for the eventual goal of a large 
computational system, this can become quite daunting, since 
the number of parameters necessary goes like 2 2N , where N 
is the number of qubits. Perhaps dynamic learning can allow 
us to find a shortcut. This paper is a first step in that direction. 

If we knew or could determine the relative phases {8} of the 
basis states, we could apply the (unitary) phase shift operator 
of {e~ l6 } to each relevant part of our input state. Since the 
coefficients would then be real, we could then perform our 
entanglement witness measurement and achieve results like 
those in Figure Q] 

In 2005, Yang and Han (7] found an algorithm for de- 
termining the relative phase between parts of the n-qubit 
Bell (or GHZ) state, ^O.-.O > +e^-y/T=p\l...l >. They 
showed that performing a Hadmard transform on each qubit 
puts the system in a state in which the probability of finding 
an even number of qubits in the state |1 > is given by 
Peven = \ + y / p(l — p) cos <f>. Given a large number of copies 
of the state, it is then possible to determine both p and (f>. Here 
we show that, with our QNN, we can extend this result, for the 
2-qubit system, in two ways. First, we show that we can also 
find the phase offset in an EPR state, aoi|01 > +e l9 aio|10 >. 
Second, we show that we can also find the phase offset for 
any of the partially entangled states consisting of an EPR or 
a Bell state with some contaminant: 
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II. Dynamic learning: quantum neural network 
(QNN) 

We consider 2-qubit system whose Hamiltonian is: 

H = KaCTxA + K B <J X B + £A<JzA + £B&zB + QOzAOzB (2) 

where {a} are the Pauli operators corresponding to each of the 
two qubits, A and B, Ka and Kb are the tunneling amplitudes, 
£a and £b are the biases, and Q the qubit-qubit coupling. The 
time evolution of the system is then given by the Schrodinger 
equation: 

| = >,P] 0) 
at in 

where p is the density matrix and H is the Hamiltonian. The 
parameters {K, e, control the time evolution of the system 
in the sense that, if one or more of them is changed, the way 
a given state will evolve in time will also change. This is 
the basis for using our quantum system as a neural network. 
The role of the "weights" of the network is played by the 
parameters of the Hamiltonian, {K, e, £}, all of which we take 



to be experimentally adjustable as functions of time (see, e.g., 
J8), for the case of SQuID charge qubits.) By adjusting the 
parameters using a neural learning algorithm we can train the 
system to evolve in time to a set of chosen target outputs 
at the final time tf, in response to a corresponding (one- 
to-one) set of given inputs. Because the time evolution is 
quantum mechanical (and, we assume, coherent), a quantum 
mechanical function, like an entanglement witness of the initial 
state, can be mapped to an observable of the system's final 
state, a measurement made at the final time tf. The time 
evolution of the quantum system is calculated by integrating 
the Schrodinger equation numerically in MATLAB Simulink, 
using ODE4 (Runge-Kutta), with a fixed integration step size 
of 0.05 ns [9|. The system was initialized in each input state 
in the training set, in turn, then allowed to evolve for 190 
ns. A measurement is then made at the final time; this is the 
"output" of the network. An error, target — output, is calcu- 
lated, and the parameters are adjusted slightly to reduce the 
error. This is repeated for each (input, target) pair multiple 
times until the calculation converges on parameters that work 
well for the entire training set. Complete details, including a 
derivation of the quantum dynamic learning paradigm using 
backpropagation iflOl in time ifTTl . are given in J2]. 

We choose the usual "charge basis ", in which each qubit's 
state is given as or 1. 

All of the parameters {K, e, (} were taken to be functions 
of time; in contrast to our earlier work [0, [0], |]5], in which 
the parameters were taken to be piecewise constant in time, we 
have, here, allowed the parameters to be continuous functions 
of time. For the backpropagation learning, the output error 
needs to be back-propagated backward through time ifPTl . so 
the integration has to be carried out from the final time tf to 0. 
To implement this in MATLAB Simulink, a change of variable 
is made by letting t' = tf — t, and running this simulation 
forward in t' in Simulink. 

III. Training of the phase indicator 

In the charge basis, we can write a general pure state of the 
system at time t = as 

|tf (0) >= a 00 |00 > +a ie^|01 > +ai e l6 |10 > (4) 

+a n e^\ll > 

where normalization requires that 

\J a 2 00 + ali + a 2 w + a 2 xl = 1 (5) 

Since an overall phase is physically meaningless we may take 
out any overall phase factor; that is, without loss of generality 
we may take the coefficient of the 1 00 > basis state to be real. 
We then write each of the other coefficients as its magnitude 
times a phase factor; thus, each a nm will be a real number, 
and the phase factor, if any, will be written in explicitly. As 
discussed above, the state of the system evolves under the 
Hamiltonian, Equation |2] to another state, >, at the 

final time tf. At that final time we make a measurement. In 
the terminology of neural network learning: (1) the input to 



the neural network is the initial state 1^(0) > at time t = 
of the quantum system; (2) the output of the neural network 
is a quantum measure made on the final state at the final time 
t = tf of the quantum system; and (3) the trainable weights 
of the neural network are the time histories of the adjustable 
parameters of the quantum system. The network is trained 
on a set of training pairs, each of which consists of (input, 
correct output). Each training pair is presented to the network, 
the output is calculated, the error computed, and the weights 
changed so as to decrease the error [2 |. Each pass through the 
entire training set is called an epoch. 

As with all good science, we began with what was already 
known Q: namely, that it is possible to extract relative 
phase information from the Bell state, \Bell >— a O o|00 > 
+a 11 e 4< *|ll >. Because we are using a learning process, 
it is important to see how much information we can get 
with as little input as possible. Thus, our original training 
set consisted of only 11 training pairs, using only equal 
amplitude Bell states \Bell >= aoo |00 > +a n e^|ll > with 
aoo = an = ;4g> where the phase angle <f> varies from — 7r/2 

to 7r/2 as 4> = — § + io ' f° r n = 1 : 11- The network 
output is the absolute magnitude squared of the projection of 
the final state of the quantum system onto the state |11 >, i.e., 
the probability of the system's being found in the state |11 >. 
The correct, or target, output for these equal-amplitude EPR 
states is taken to be just cos 2 (0/2). That is, the (input,output) 
pairs are 

input = |*(0) >= -^(100 > +e i0 |ll >) (6) 

output = I < > | 2 -> target = cos 2 ((/>/2) 

The network was trained for 10 epochs, on a total of 11 
training pairs. The average RMS error of all 1 1 training pairs 
after training is 0.0127. A plot of RMS error vs epoch is shown 
in Figure [3] A plot of output vs target for the 1 1 training 
pairs is shown in Figure |4] A plot of the trained parameters 
as functions of time is shown in Figure [5] Each is a simple 
oscillatory function. Note that the trained tunneling amplitude 
functions Ka and Kb lie right on top of each other, as do 
ca and es, which is unsurprising given the symmetry of the 
training set. 

To see if the network has generalized (i.e., learned as 
opposed to having simply curvefitted), we then tested (with no 
additional training) on a set of Bell states of random relative 
magnitude, that is, on states of the type \Bell >= aoo|00 > 
+ane l ^|ll > with now randomly generated numbers for 
aoo and an (such that the state remained normalized, i.e., 
V a oo + a ii = !•) F rom 121 we knew that it was unlikely that 
we would be able to train to the same simple target function 
cos 2 (</>/2) , and so it transpired; however, we found that a 
simple analogue, 2(^ — ao ) 2 a 2 i + 2anoan cos 2 (</>/2), did 
work quite well. Again, the measurable is the probability of 
the system's being found in the |11 > state at the final time, 
i.e.,\ < 11 \ty(tf) > | 2 . Note that this target function reduces 
to the target function used for training, when aoo = an = -t=, 




Epoch 

Fig. 3. RMS error per training pair vs. epoch (pass through the training set) 
for the (f> phase offset indicator. The training set of 1 1 (input.output) pairs is 
given in the text. 




Target 

Fig. 4. Results for the training set for the 9 phase offset indicator, showing 
deviation of the output, | < 11 |\P(t/) > | 2 , from the target function 
cos 2 (</)/2). Average RMS error per pair is 0.0127. The line shows the goal 
(perfect agreement.) 
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Fig. 5. The functions Ka, Kb, £A, £b, and C as functions of time, as 
trained for the phase offset 9. Ka and Kg lie right on top of each other, as do 
tA and e s- Each was started out (pre-training values) as constant functions: 
K A = K B = 2.5 X 10- a GHz , and e A = e B = f = lQ~ i GHz. 



and maintains the necessary symmetry. These data are plotted 
in Figure [6] (blue triangles.) As can easily be seen in the figure, 
agreement is excellent. 

The ability of the system to map onto the target function 
depends on its being an entangled state [7 1; however, as long as 
we adjust the target function appropriately, full entanglement 
is, clearly, not necessary. Thus it ought also to be possible 
to find the phase offset for a partially entangled input state, 
e.g., of the form a O o|00 > +a i|01 > +e^au|ll >. How 
do we do this? We consider a probability-weighted target 
function, equal to our earlier targets for the special case of 
the pure Bell state, but adjusting the relative function for the 
diminished entanglement. We are guided here by symmetry 
and by earlier analytic results [7|, in which it was found that, 
while the relative phase was extractable, it was not easily 
separable from the amplitude information, and, in fact, had 
to be separately measured for (hence, the necessity for "many 
copies" of the original state.) Experimentation eventually gave 
us the following (relatively) simple functions. For the Bell 
state, aoo|00 > +ane l *|ll >, the target function for the 
output | < > | 2 is: 
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For the \BP 1 >= a O o|00 > +a i|01 > +ane^|ll > state, 
the target function for the output | < ll\^f(tf) > | 2 is: 



targetBP 1 = 2| 
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For the \BP 2 >= a 00 |00 > +oi |10 > +e^a u |ll > state, 
the target function for the output | < > | 2 is: 
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Note that these target functions agree with the functions 
used for training: that is, the training states -^=[j00 > 
+e l *|ll >], for (p : ~ir/2 to 7r/2, had a target function given 
by cos 2 (</>/2); this is exactly what the training function in 
Equation [7] reduces to, in the case a o = an = -j=- Similarly 
the target functions for the \BP > states both reduce to the 
function tested on for the pure Bell states in the case of equal 
amplitudes of Testing results for 550 randomly generated 
states of all three types, for all values of the angle </>, are 
shown in Figure [6] Agreement is quite good, even remarkable, 
considering that the system was trained only on 11 phase 
angles for an equal-amplitude Bell state. The average RMS 
error per pair over all 550 testing pairs after training is 0. 0270. 

With some confidence in our method, we now extend to the 
corresponding states of the two qubit system that also can have 
maximal entanglement, the EPR states, |\&(0) >= aoi|01 > 
+aioe l6 |10 >. By symmetry, these states are "the same" as 
the Bell states; thus, we would expect that similar training 
ought to be able to map the phase shift to the projection onto 
the 1 01 > basis state. However, with no further training, we 
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Fig. 6. Results for the testing set for the phase offset </>, consisting of three 
types of states: unequal amplitudes Bell states aoo|00 > +e 1 ^ > an |11 > (blue 
triangles), and two kinds of partially entangled states, aoo|00 > +aoi|01 > 
+e^an|ll > (green squares), and a 00 |00 > +ai |101 > +€^0^11 > 
(red circles). As in Figure |4] we show the deviation of the output from the 
target functions (given in the text.) Average RMS error per pair is 0.0270. 
The line shows the goal (perfect agreement.) 



were also able to recover this information! In other words, 
the neural net, trained to map <j> information to the projection 
onto the basis state |11 >, also maps the 8 information to the 
projection onto the |01 > basis state. For equal amplitudes, 
o-oi — o-io — we again use the simple cosine function, 
cos 2 (6*/2); we use the analogous measure on the final state, 
| < 10\^(tf) > | 2 . That is, the (input,output) pairs for equal 
amplitude EPR states are 



input = |*(0) >= 



1 



V2 



(|01> 



jo i 



10 > (10) 



output = | < I0\^(t f ) > | 2 -» target = cos 2 (6>/2) 

For non-equal amplitude EPR states, and for the analogous 
partially entangled EPR states, we employ exactly analogous 
target functions as with the Bell states. For the EPR state, 
aoi|01 > +aioie l6 '|10 >, we take the target function for the 
output | < 10|#(*/) > | 2 to be: 

target E pR = 2(- - a 2 m ) 2 a 2 10 + 2a 01 a w cos 2 (6 / 2) (11) 

For the \EP X >= a 00 |00 > +a O i|01 > +ai O e ie |10 > state, 
the target function for the output | < 10|'J , (t/) > | 2 is: 

target EPl = 2|i - al \a 2 01 a 2 1Q + 3|^ - a^|ag a 2 (12) 

+2aoiaio cos 2 (0/2) 

For the \EP 2 >= a 01 \01 > +ai O e ie |10 > +an|ll > state, 
the target function for the output | < 10|*(i/) > | 2 is: 

target E p 2 = 2|i - al^a^ajo + 3|i - aoil a ii a ?o ( 13 ) 

+2a m a w cos 2 (6>/2) 
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Fig. 7. Results for the testing set on the phase offset 9, consisting of 
input states of three types: (1) aoi|01 > +ai u e I< ' > |10 > (blue triangles); 
(2) \EPx(0) >= [a o|00 > +a i|01 > +e ifl ai |10 > (green squares), 
and (3) \EP 2 (9) >= a i|01 > +e ie aio|10 > +aii|ll > (red circles.) 
Again, we show the deviation of the output from the target function (given 
in the text.) Average RMS error per pair is 0.0489. The line shows the goal 
(perfect agreement.) 

Results for testing on 550 randomly generated states of all 
three types are shown in Figure [7] 

If we can recover phase offset information on both |11 > 
and 1 10 > projections, we ought to be able to do so on |01 >. 
And so we can. Again we test only (no additional training), 
using, this time, the projection onto the 1 01 > state, and 
looking for information about the phase offset term multiplied 
by that basis state. Our target functions are the exact analogues 
to the \EPR > and \EP X .2 > targets. For the EPR state with 
the £ offset, aoie^jOl > +airj|10 >, the target function for 
the output | < 01|*(t/) > | 2 is: 

target EPRx = 2(i - a^fa^ + 2a 10 a 01 cos 2 (£/2) (14) 

For the \EP 3 >= a ie^|01 > +ai |10 > +an|ll > state, 
the target function for the output | < 01|^(t/) > | 2 is: 

target EP3 = 2|i - a^a^a 2 ,! + 3|^ - a^la 2 ^ (15) 

+2ai a iCos 2 (£/2) 

For the \EP 4 >= a 00 |00 > +a O ie i? |01 > +aio|10 > state, 
the target function for the output | < 01|*(i/) > | 2 is: 

target EPi = 2|i - al \a 2 w a 2 m + 3|^ - a? |ag a^ (16) 

+2ai a i cos 2 (£/2) 

Results are shown in Figure [8] for 550 randomly generated 
states. 

IV. Conclusion 

We have shown that a two-qubit quantum system, con- 
sidered as a trainable quantum neural net, can compute its 
own phase offsets. The training is not difficult: the training 
set consisted of only 11 training pairs, of a single type, and 
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Fig. 8. Results for the testing set on the phase offset £, consisting of 
input states of three types: (1) e^aoi|01 > +aio|10 > (blue triangles); 
l-EP 3 (0 >= «0ie i? |01 > +cii |10 > +an|ll > (green squares), and (3) 
(2) \EP 4 (0 >= a o|00 > +e^a i|01 > +ai |10 > (red circles.) Again, 
we show the deviation of the output from the target function. Average RMS 
error per pair is 0.0699. The line shows the goal (perfect agreement.) 

the set was trained for only 10 epochs. Agreement is not 
perfect, and, doubtless, a more complicated function could be 
devised such that better agreement would be reached. But if we 
are considering inverting these functions, in order to perform 
the rotations that would enable our use of the entanglement 
estimator discussed in the Introduction, simplicity is also 
important. Because our method relies on the phase offset's 
being on a basis state which carries entanglement, it is not 
completely general; however, since our goal is to be able to 
estimate the entanglement of a general input state, it does 
not really matter, since no phase correction is necessary to 
an unentangled state, and would make no difference to the 
calculation if made. 

Our previous work 0, |]5], which extended our work on 
entanglement in 2-qubit systems to n-qubit systems, seems 
to indicate that extension of our present results to multiple 
qubit systems should be possible without too much difficulty. 
The ease with which we are able to extract multiple angle 
information is encouraging. It should be not too difficult to 
perform the inverse rotations, and, thereby, to be able to form 
a good and reliable estimate for the entanglement with only a 
very few measurements, even for many-qubit systems. We are 
currently working on these calculations, and on the extension 
of our results to mixed systems. 
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